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Abstract 

We introduce an extended tableau calculus for answer set programming (ASP). The proof 
system is based on the ASP tableaux defined in [Gebser&Schaub, ICLP 2006], with an 
added extension rule. We investigate the power of Extended ASP Tableaux both theo- 
retically and empirically. We study the relationship of Extended ASP Tableaux with the 
Extended Resolution proof system defined by Tseitin for sets of clauses, and separate Ex- 
tended ASP Tableaux from ASP Tableaux by giving a polynomial-length proof for a family 
of normal logic programs {Hn} for which ASP Tableaux has exponential-length minimal 
proofs with respect to n. Additionally, Extended ASP Tableaux imply interesting insight 
into the effect of program simplification on the lengths of proofs in ASP. Closely related 
to Extended ASP Tableaux, we empirically investigate the effect of redundant rules on 
the efficiency of ASP solving. 

KEYWORDS: Answer set programming, tableau method, extension rule, proof complexity, 
problem structure 



1 Introduction 

Answer set programming (ASP) (Marek and Truszczynski 1999; Niemela 1999; 
Gelfond and Leone 2002; Lifschitz 2002; Baral 2003) is a declarative problem solving 
paradigm which has proven successful for a variety of knowledge representation and 
reasoning tasks (see (Soininen et al. 2001; Nogueira et al. 2001; Erdem et al. 2006; 
Brooks et al. 2007) for examples). The success has been brought forth by efficient 
solver implementations such as smodels (Simons et al. 2002), div (Leone et al. 2006), 
noMore++ (Anger et al. 2005), cmodels (Giunchiglia et al. 2006), assat (Lin and 
Zhao 2004), and clasp (Gebser et al. 2007). However, there has been an evident lack 
of theoretical studies into the reasons for the efficiency of ASP solvers. 

Solver implementations and their inference techniques can be seen as determinis- 
tic implementations of the underlying rule-based proof systems. A solver implements 

* This is an extended version of a paper (Jarvisalo and Oikarinen 2007) presented at the 23rd 
International Conference on Logic Programming (ICLP 2007) in Porto, Portugal. 
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a particular proof system in the sense that the propagation mechanisms appHed by 
the solver apply the deterministic deduction rules in the proof system, whereas the 
nondeterministic branching/splitting rule of the proof system is made deterministic 
through branching heuristics present in typical solvers. From the opposite point of 
view, a solver can be analyzed by investigating the power of an abstraction of the 
solver as the proof system the solver implements. Due to this strong interplay be- 
tween theory and practice, the study of the relative efficiency of these proof systems 
reveals important new viewpoints and explanations for the successes and failures 
of particular solver techniques. 

A way of examining the best-case performance of solver algorithms is provided 
by (propositional) proof complexity theory (Cook and Reckhow 1979; Beame and 
Pitassi 1998), which concentrates on studying the relative power of the proof sys- 
tems underlying solver algorithms in terms of the shortest existing proofs in the 
systems. A large (superpolynomial) difference in the minimal length of proofs avail- 
able in different proof systems for a family of Boolean expressions reveals that solver 
implementations of these systems are inherently different in strength. While such 
proof complexity theoretic studies are frequent in the closely related field of propo- 
sitional satisfiability (SAT), where typical solvers have been shown to be based on 
refinements of the well-known Resolution proof system (Beame et al. 2004), this has 
not been the case for ASP. Especially, the inference techniques applied in current 
state-of-the-art ASP solvers have been characterized by a family of tableau-style 
ASP proof systems for normal logic programs only very recently (Gebser and Schaub 
2006b), with some related proof complexity theoretic investigations (Anger et al. 
2006) and generalizations (Gebser and Schaub 2007). The close relation of ASP 
and SAT and the respective theoretical underpinning of practical solver techniques 
has also received little attention up until recently (Giunchiglia and Maratea 2005; 
Gebser and Schaub 2006a), although the fields could gain much by further studies 
on these connections. 

This work continues in part bridging the gap between ASP and SAT. Influenced 
by Tseitin's Extended Resolution proof system (Tseitin 1969) for clausal formu- 
las, we introduce Extended ASP Tableaux, an extended tableau calculus based on 
the proof system in (Gebser and Schaub 2006b). The motivations for Extended 
ASP Tableaux are many-fold. Theoretically, Extended Resolution has proven to 
be among the most powerful known proof systems, equivalent to, for example, ex- 
tended Frege systems; no exponential lower bounds for the lengths of proofs are 
known for Extended Resolution. We study the power of Extended ASP Tableaux, 
showing a tight correspondence with Extended Resolution. 

The contributions of this work are not only of theoretical nature. Extended ASP 
Tableaux is in fact based on adding structure into programs by introducing addi- 
tional redundant rules. On the practical level, the structure of problem instances has 
an important role in both ASP and SAT solving. Typically, it is widely believed that 
redundancy can and should be removed for practical efficiency. However, the power 
of Extended ASP Tableaux reveals that this is not generally the case, and such 
redundancy removing simplification mechanisms can drastically hinder efficiency. 
In addition, we contribute by studying the effect of redundancy on the efficiency of 
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a variety of ASP solvers. The results show that the role of redundancy in programs 
is not as simple as typically believed, and controlled addition of redundancy may in 
fact prove to be relevant in further strengthening the robustness of current solver 
techniques. 

The rest of this article is organized as follows. After preliminaries on ASP and 
SAT (Section 2), the relationship of Resolution and ASP Tableaux proof systems 
and concepts related to the complexity of proofs are discussed (Section 3). By in- 
troducing the Extended ASP Tableaux proof system (Section 4), proof complexity 
and simplification are then studied with respect to Extended ASP Tableaux (Sec- 
tion 5). Experimental results related to Extended ASP Tableaux and redundant 
rules in normal logic programs are presented in Section 6. 

2 Preliminaries 

As preHminaries we review basic concepts related to answer set programming (ASP) 
in the context of normal logic programs, propositional satisfiability (SAT), and 
translations between ASP and SAT. 

2.1 Normal Logic Programs and Stable Models 

We consider normal logic programs (NLPs) in the propositional case. In the following 
we will review some standard concepts related to NLPs and stable models. 
A normal logic program 11 consists of a finite set of rules of the form 

r : h ^ ai,. . . ,an,^bi,. . . ,^brn, (1) 

where each aj and bj is a propositional atom, and h is either a propositional atom, 
or the symbol _L that stands for falsity. A rule r consists of a head, head(r) = h, 
and a body, body(r) = {04, . . . , a }. The symbol "~" denotes default 

negation. A default literal is an atom a, or its default negation ~a. 
The set of atoms occurring in a program 11 is atom (11), and 

dlit(n) = {a, ~a I a e atom(n)} 

is the set of default literals in 11. We use the shorthands L"*" = {a | a G L} and 
L~ = {a \ G L} for a set L of default literals, and ~^ = {^a \ a G A} for a 
set A of atoms. This allows the shorthand 

head(r) ■*— body(r)~'~ U~body(r)~ 

for (1). A rule r is a fact if body(r) = 0. Furthermore, we use the shorthands 

head(n) = {head(r) | r G 11} and 
body(n) = {body(r) | r G H}. 

In ASP, we are interested in stable models (Gelfond and Lifschitz 1988) (or answer 
sets) of a program 11. An interpretation M C atom(n) defines which atoms of 11 
are true (a G M) and which are false (a ^ M). An interpretation M C atom (IT) is 
a (classical) model of 11 if and only if body(r)"'" C M and body(r)~ fl M = imply 
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head(r) € M for each rule r € 11. A model M of a program 11 is a stable model 
of n if and only if there is no model M' C M of 11^, where 

= {head(r) ^ body(r)+ | r G H and body(r)- n Af = 0} 

is called the Gelfond-Lifschitz reduct of 11 with respect to M. We say that a pro- 
gram n is satisfiable if it has a stable model, and unsatisfiable otherwise. 

The positive dependency graph of 11, denoted by Dep~'~(n), is a directed graph 
with atom(n) and 

{{b,a) I 3r e n such that b = head(r) and a G body(r)~''} 

as the sets of vertices and edges, respectively. A non-empty set L C atom(n) is a 
loop in Dep^(n) if for any a,b £ L there is a path of non-zero length from a to 6 
in Dep~'~(n) such that all vertices in the path are in L. We denote by loop(n) the 
set of all loops in Dep"'"(n). A NLP is tight if and only if loop(n) = 0. Furthermore, 
the external bodies of a set A of atoms in 11 is 

eh{A) = {body(r) | r e 11, head(r) e A, body(r)+ n ^ = 0}. 

A set ?7 C atom(n) is unfounded if eb{U) = 0. We denote the greatest unfounded 
set, that is, the union of all unfounded sets, of n by gus(n). 

A splitting set (Lifschitz and Turner 1994) for a NLP 11 is any set U C atom (11) 
such that for every r G 11, if head(r) G U, then body(r)+ U body(r)~ C U. The 
bottom of n relative to U is 

bottom(n, U") = {r G n I atom({r}) C U}, 

and the top of 11 relative to U is 

top(n, ?7) = n \ bottom (n, U). 

The top can be partially evaluated with respect to an interpretation X C U. The 
result is a program eval(top(n, C/), X) that contains the rule 

head(r) ^ (body(r)+ \ U), ~(body(r)- \ U) 

for each r G top(n, U) such that body(r)+ (lU C X and (body(r)" n C/) fl X = 0. 
Given a splitting set U for a NLP 11, a solution to IL with respect to /7 is a pair {X, Y) 
such that X CU,Y C atom(n) \ [/, X is a stable model of bottom (11, U), and Y 
is a stable model of eval(top(n, U),X). In this work we will apply the splitting set 
theorem (Lifschitz and Turner 1994) that relates solutions with stable models. 

Theorem 2.1 ((Lifschitz and Turner 1994)) Given a normal logic program 11 
and a splitting set U for H, an interpretation M C atom(n) is a stable model of II 
if and only if {M nU, M \ U) is a solution to H with respect to U. 

2.2 Propositional Satisfiability 

Let AT be a set of Boolean variables. Associated with every variable x € X there 
are two literals, the positive literal, denoted by x, and the negative literal, denoted 
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by X. A clause is a disjunction of distinct literals. We adopt the standard convention 
of viewing a clause as a finite set of literals and a CNF formula as a finite set of 
clauses. The set of variables appearing in a clause C (a set C of clauses, respectively) 
is denoted by var(C) (var(C), respectively). 

A truth assignment t associates a truth value t{x) € {false, true} with each 
variable x € X. A truth assignment satisfies a set of clauses if and only if it satisfies 
every clause in it. A clause is satisfied if and only if it contains at least one satisfied 
literal, where a literal x {x, respectively) is satisfied if t{x) = true (t(x) = false, 
respectively). A set of clauses is satisfiable if there is a truth assignment that satisfies 
it, and unsatisfiable otherwise. 



2.3 SAT as ASP 

There is a natural linear-size translation from sets of clauses to normal logic pro- 
grams so that the stable models of the encoding represent the satisfying truth 
assignments of the original set of clauses faithfully, that is, there is a bijective 
correspondence between the satisfying truth assignments and stable models of the 
translation (Niemela 1999). Given a set C of clauses, this translation nlp(C) in- 
troduces a new atom c for each clause C G C, and atoms ax and dx for each 
variable x G var(C). The resulting NLP is then 

nlp(C) = {ux <- dx ^ ^Ux \ x € var(C)} U (2) 

{_L ^ ~c I C G C} U (3) 

{c^ ax\x eC, C gC, x e var(C)} U (4) 

{c ~ax \ x gC, C gC, X G var(C)}. (5) 

The rules (2) encode that each variable must be assigned an unambiguous truth 
value, the rules in (3) that each clause in C must be satisfied, while (4) and (5) 
encode that each clause is satisfied if at least one of its literals is satisfied. 

Example 2.2 The set C = {{x,y}, {x,y}, {x,y}, {x,y}} of clauses is represented 
by the normal logic program 

nlp(C) = {ax ^ ^dx- dx ^ ^a^- ay •*— ~ay. dy <— '^Uy. 

_L ^ ^Ci. ± ^ ~C2. -L <— ~C3. ± <— ~C4. 

ci ^ ax- ci ^ ay. C2 ^ ax- C2 ^ ^ay. 

C3 ^ ~aa;. C3 ^ ay. Ci ^ ^ttx- C4 ^ ^ay }. 



2.4 ASP as SAT 

Contrarily to the case of translating SAT into ASP, there is no modular^ and faith- 
ful translation from normal logic programs to prepositional logic (Niemela 1999). 

^ Intuitively, for a modular translation, adding a set of facts to a program leads to a local change 
not involving the translation of the rest of the program (Niemela 1999). 
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Moreover, any faithful translation is potentially of exponential size when additional 
variables are not allowed (Lifschitz and Razborov 2006)^. However, for any tight 
program 11 it holds that the answer sets of 11 can be characterized faithfully by the 
satisfying truth assignments of a linear-size propositional formula called Clark's 
completion (Clark 1978; Pages 1994) of H, defined using a Boolean variable Xa for 
each a £ atom (11) as 

c(n) = /\ v ( A A ^0)' 

heatom(n)u{±} V rGrule(/i) V 6ebody(r)+ f)Gbody(r)- // 

where rule(/i) = {r G 11 | head(r) = h}. Notice that there are the special cases 

(i) if /i is _L then the equivalence becomes the negation of the right hand side, 

(ii) if ft- is a fact, then the equivalence reduces to the clause {xh}, and (iii) if an 
atom h does not appear in the head of any rule then the equivalence reduces to the 
clause {xh}. 

In this work, we will consider the clausal representation of Boolean formulas. 
A linear-size clausal translation of C (U) is achieved by introducing additionally a 
new Boolean variable Xb for each B e body(n). Using the new variables for the 
bodies, we arrive at the clausal completion 



com 



p(n) = U A A U ^^^""^^ 

Bebody(n) I aeB+ beB- J Bebody(rule(-L)) 

U D lxh= y xb\ (8) 

/iehead(n)\{_L} I BGbody(rule(/»)) J 

U U (9) 

aeatom(n)\head(n) 

where the shorthands x = Ax ex -"^i 2; = \J x ex -^i stand for the sets of clauses 
{x,xi, . . .,Xn} U {Jxiexi^'^i} and {Jxiexi^'^i} ^ • • • i^^n}, respectively. 



Example 2.3 For the normal logic program 11 = {a ■*— 6, ^a. 6 <— c. c ■*— the 
clausal completion is 

COmp(n) = {{X{b,r^a},Xa,Xb},{x{b^r^a},Xa},{x{b^^a},Xb}, 

{Xa, X{b,r^a}}, {Xb, X{c}}, {Xb, X{c}}, {Xc, ^{-6}}}, {^c, X{r^b}}- 



^ However, polynomial-size propositional encodings using extra variables are known, see (Ben- 
Eliyahu and Dechter 1994; Lin and Zhao 2003; Janhunen 2006). Also, ASP as Propositional 
Satisfiability approaches for solving normal logic programs have been developed, for example, 
assat (Lin and Zhao 2004) (based on incrementally adding — possibly exponentially many — loop 
formulas) and asp-sat (Giunchiglia et al. 2006) (based on generating a supported model (Brass 
and Dix 1995) of the program and testing its minimality — thus avoiding exponential space 
consumption). 
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3 Proof Systems for ASP and SAT 

In this section we review concepts related to proof complexity (Cook and Reck- 
how 1979; Beame and Pitassi 1998) in the context of this work, and discuss the 
relationship of Resolution and ASP Tableaux (Gebser and Schaub 2006b) . 

3.1 Propositional Proof Systems and Complexity 

Formally, a (propositional) proof system is a polynomial-time computable predi- 
cate S such that a propositional expression E is unsatisfiable if and only if there 
is a proof P for which S{E,P) holds. A proof system is thus a polynomial-time 
procedure for checking the correctness of proofs in a certain format. While proof 
checking is efficient, finding short proofs may be difficult, or, generally, impossible 
since short proofs may not exist for a too weak proof system. As a measure of 
hardness of proving unsatisfiability of an expression £^ in a proof system S, the 
(proof) complexity of E in S is the length of the shortest proof for E in S. For a 
family {E,, } of unsatisfiable expressions over increasing number of variables, the 
(asymptotic) complexity of {En} is measured with respect to the sizes of £"„. 

For two proof systems S and S', we say that S' polynomially simulates S if 
for all families {En} it holds that Cs'{En) < p{Cs{En)) for all En, where p is a 
polynomial, and C's and C's' arc the complexities in S and S', respectively. If S 
simulates S' and vice versa, then S and S' are polynomially equivalent. If there is 
a family {En} for which S' does not polynomially simulate S, we say that {En} 
separates S from S'. If S simulates S", and there is a family {En} separating S 
from S' , then S is more powerful than S'. 

3.2 Resolution 

The well-known Resolution proof system (RES) for sets of clauses is based on the 
resolution rule. Let C, D be clauses, and x a Boolean variable. The resolution rule 
states that we can directly derive C U D from {a;} U C and {x} U by resolving 
on X. 

A RES derivation of a clause C from a set C of clauses is a sequence of clauses 
TT = (Ci, C2, . . . , C„), where Cn = C and each Ci, where 1 < i < n, is either 
(i) a clause in C (an initial clause), or (ii) derived with the resolution rule from two 
clauses Cj. Ck, where j, k < i {a derived clause). The length of tt is n, the number 
of clauses occurring in it. Any derivation of the empty clause from C is a RES 
proof for (the unsatisfiability of) C. 

Any RES proof tt = (Ci, C2, • ■ • , C„ =0) can be represented as a directed acyclic 
graph, in which the leafs are initial clauses and other nodes are derived clauses. 
There are edges from C, and Cj to Ck if and only if Ck has been directly derived 
from Ci and Cj using the resolution rule. Many Resolution refinements, in which 
the structure of the graph representation is restricted, have been proposed and 
studied. Of particular interest here is Tree-like Resolution (T-RES), in which it is 
required that proofs are represented by trees. This impHes that a derived clause. 
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if subsequently used multiple times in the proof, must be derived anew each time 
from initial clauses. 

T-RES is a proper RES refinement, that is, RES is more powerful than T-RES (Ben- 
Sasson et al. 2004). On the other hand, it is well known that the DPLL method (Davis 
and Putnam 1960; Davis et al. 1962), the basis of most state-of-the-art SAT solvers, 
is polynomially equivalent to T-RES. However, conflict-learning DPLL is more pow- 
erful than T-RES, and polynomially equivalent to RES under a slight generaliza- 
tion (Beame et al. 2004). 

3.3 ASP Tableaux 

Although ASP solvers for normal logic programs have been available for many years, 
the deduction rules applied in such solvers have only recently been formally deflned 
as a proof system, which we will here refer to as ASP Tableaux or ASP-T (Gebser 
and Schaub 2006b). 

An ASP tableau for a NLP 11 is a binary tree of the following structure. The 
root of the tableau consists of the rules 11 and the entry F_L for capturing that _L 
is always false. The non-root nodes of the tableau are single entries of the form Ta 
or Fa, where a G atom(n) U body(n). As typical for tableau methods, entries are 
generated by extending a branch (a path from the root to a leaf node) by applying 
one of the rules in Figure 1; if the prerequisites of a rule hold in a branch, the 
branch can be extended with the entries specifled by the rule. For convenience, we 
use shorthands tl and fl for default literals: 

_ (Ta, if ? = a is positive, 

[Fa, if I = ~a is negative; and 

_ fTa, if Z = is negative, 
\Fa, if Z = a is positive. 

A branch is closed under the deduction rules (b)-(i) if the branch cannot be 
extended using the rules. A branch is contradictory if there are the entries Ta 
and Fa for some a. A branch is complete if it is contradictory, or if there is the 
entry Ta or Fa for each a G atom(n) U body(n) and the branch is closed under 
the deduction rules (b)-(i). A tableau is contradictory, if all its branches in are 
contradictory, and nan- contradictory otherwise. A tableau is complete if all its 
branches are complete. A contradictory tableau from 11 is an ASP-T proof for (the 
unsatisflability of) 11. The length of an ASP-T proof is the number of entries in it. 

Example 3.1 An ASP-T proof for the NLP H = {a ^ b, ^a. b c. c <— ~6} is 
shown in Figure 2, with the rule applied for deducing each entry given in paren- 
theses. For example, the entry Fa has been deduced from a <— b, ~a in H and the 
entry T{6, ^a} in the left branch by applying the rule (g) Backward True Body. On 
the other hand, T{6, ^a} has been deduced from a ^ b, ^a in 11 and the entry Ta 
in the left branch by applying the rule (i^), that is, rule (i) by the fact that the 
condition § "Backward True Atom" is fulfilled (in n, the only body with atom a in 
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T(t> I F0 
(a) Cut 



h < /i , . . . , In 
t/l , . . . , tiln 

T{h,...,l„} 
(b) Forward True Body 

h li , /„, 

T{h,...,L} 



Th 

(d) Forward True Atom 



ti , . . . , t j , . 
fh 



) in 



F{ll, . . . . . . , In} 

(f) Forward False Body 



F{h, . . . ,li, . . . ,ln} 

Hi , . . . , t/j_i , t/j_|_i , . . . , Hji 
fk 

(c) Backward False Body 

h <— h, . . . ,ln 
Fh 



F{ll,...,ln} 

(e) Backward False Atom 



T{/i,. 



■,ln} 



tli 

(g) Backward True Body 



FBi,. 



, FBri 



(b) 



Fh 

(h) 



FBi, . . . , Fi?i_i, FBj+i, . . . , FBr, 
Th 

TBi 
(i) 



(H) 



(t]): Applicable when (f) £ atom(n) U body(n). 

(b): Applicable when one of the following conditions holds: 

§ ("Forward False Atom"), f ("Well- Founded Negation"), or J ("Forward Loop"), 
(jj): Applicable when one of the following conditions holds: 

§ ("Backward True Atom"), f ("Well- Founded Justification"), or J; ("Backward Loop"). 
(§): Applicable when body(rule(/i)) = {Si, . . . ,Bm}- 
(t): Applicable when 

{Bi,...,Bm} C body(n) and /i Ggus({r e n [ body(r) {Bi, . . . B™}}). 
(t): Applicable when h e L, L € loop(n), and eb(L) = {Bi, Bm} all hold. 

Fig. 1. Rules in ASP Tableaux. 



the head is {6, ~o}j. The tableau in Figure 2 has two closed branches: 
(nu{F_L},Ta,T{6,~a},Fa) and 

(n U {F_L}, Fa, F{6, ~a}, F6, T{~6}, Tc, T{c}, T6). 
These branches share the common prefix (11 U {F_L}). 

Any branch B describes a partial assignment A on atom (11) U body (11) in a natural 
way, that is, if there is an entry Ta (Fa, respectively) in i? for a £ atom(n) U 
body(n), then (a, true) € A ((a, false) € A, respectively). ASP-T is a sound and 
complete proof system for normal logic programs, that is, there is a complete non- 



10 



M. Jdrvisalo and E. Oikarinen 



b^c 
c^^b 




T{6,~a} (i§) F{6,^a} (e) 



(g) ¥b 


(c) 


T{^6} 


(b) 


Tc 


(d) 


T{c} 


(b) 


T6 


(d) 



X 



Fig. 2. An ASP-T proof for 11 = {a <— 6, ~a. 6 <— c. c <— ~6}. 



contradictory ASP tableau from 11 if and only if 11 is satisfiable (Gebser and Schaub 
2006b). Thus the assignment A described by a complete non-contradictory branch 
gives a stable model M = {a e atom (11) | (a, true) G A} of 11. 

As argued in (Gebser and Schaub 2006b), current ASP solver implementations 
are tightly related to ASP-T, with the intuition that the cut rule is made determin- 
istic with decision heuristics, while the deduction rules describe the propagation 
mechanism in ASP solvers. For instance, the noMore++ system (Anger et al. 2005) 
is a deterministic implementation of the rules (a)-(g),(h§),(hf ), and (i§), while 
smodels (Simons et al. 2002) applies the same rules with the cut rule restricted 
to atom(n). 

Interestinglj', ASP-T and T-RES are polynomially equivalent under the trans- 
lations comp and nip. Although the similarity of unit propagation in DPLL and 
propagation in ASP solvers is discussed in (Giunchiglia and Maratea 2005; Gebser 
and Schaub 2006a), here we want to stress the direct connection between ASP-T 
and T-RES. In detail, T-RES and ASP-T are equivalent in the sense that (i) given an 
arbitrary NLP 11, the length of minimal T-RES proofs for comp(n) is polynomially 
bounded in the the length of minimal ASP-T proofs for 11, and (ii) given an arbi- 
trary set C of clauses, the length of minimal ASP-T proofs for nlp(C) is polynomially 
bounded in the length of minimal T-RES proofs for C. 

Theorem 3.2 T-RES and ASP-T are polynomially equivalent proof systems in the 
sense that 

considering tight normal logic programs, T-RES under the translation comp polyno- 
mially simulates ASP-T, and 

considering sets of clauses, ASP-T under the translation nip polynomially simulates 
T-RES. 

In the following we give detailed proofs for the two parts of Theorem 3.2 followed 
by illustrating examples. 

In the proof of the first part of Theorem 3.2, we use a concept of a (binary) cut 
tree corresponding to an ASP-T proof. Given an ASP-T proof T for a normal logic 
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program 11, the corresponding cut tree is obtained as follows. Starting from the 
root of T, we replace each non-leaf entry generated by a deduction rule in T by an 
application of the cut rule on the corresponding entry. For example, the cut tree T' 
corresponding to the ASP-T proof T in Figure 2 is given in Figure 3 (left). 

Proof of Theorem 3.2 (i) 

Let T be an ASP-T proof for a tight normal logic program 11. Without loss of 
generality, we will assume that branches in T have not been extended further after 
they have become contradictory. We now show that we can construct a T-RES 
proof TT for comp(n) using the cut tree T' corresponding to T. Furthermore, we 
show that for such a proof tt it holds that, given any prefix p of an arbitrary 
branch B in T' there is a clause C G n contradictory to the partial assignment 
in p, that is, there is the entry Fo (Ta) for a G atom(n) U body(n) in p for each 
corresponding positive literal Xa (negative literal Xa) in C. 

Consider first the partial assignment in an arbitrary (full) branch B inT'. Assume 
that there is no clause in comp(n) contradictory to the partial assignment in B, 
that is, we can obtain a truth assignment r based on the entries in B such that every 
clause in comp(n) is satisfied in r. But this leads to contradiction since comp(n) is 
satisfied if and only if 11 is satisfied. Thus there is a clause C € comp(n) contradic- 
tory to the partial assignment in B, and we take the clause C into our resolution 
proof TT. 

Assume that we have constructed tt such that for any prefix p of length n for any 
branch B in T', there is a clause C € tt contradictory to the partial assignment in p. 
Consider an arbitrary prefix p of length n — 1. Now, in T' we have the prefixes p' 
and p" of length n which have been obtained through extending p by applying the 
cut rule on some a G atom(n) U body(n). In other words, p' is p with Ta appended 
in the end {p" is p with Fa appended in the end). Since p' {p" , respectively) is 
of length n, there is a clause C {D, respectively) in tt contradictory to the partial 
assignment in p' (p" , respectively). Now there are two possibilities. If C = {xa} UC" 
and D = {xa} U D', we can resolve on Xa adding C U D' to tt. Thus we have a 
clause C'UD' e tt contradictory to the partial assignment in the prefix p. Otherwise 
we have that Xa ^ C or Xa ^ D, and hence either C e tt or D G tt is contradictory 
to the partial assignment in the prefix p. 

When reaching the root of T', we must have derived since it is the only clause 
contradictory with the empty assignment. Furthermore, the T-RES derivation tt is 
of polynomial length with respect to T' (and T) . □ 

The following example illustrates the RES proof construction used above in the 
proof of Theorem 3.2 (i). 

Example 3.3 Consider again the tight NLP H = {a ^ b, ~a. b ^ c. c ^ '^6} 
from Example 2.3 and the ASP-T proof T for 11 in Figure 2. We now construct 
a T-RES proof for the completion comp(n) (see Example 2.3 for details) using 
the strategy from the proof of Theorem 3.2 (i). First, T is transformed into a cut 
tree T' given in Figure 3 (left). Consider now the two leftmost branches in T' . The 
partial assignment in the branch with entries Ta and F{&, ~a} is contradictory 
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{Xa}' 



{Xa} 





F{6, r^a} 


\ T{6, ~a} 


F5 




T{~6} 




Tc 


Fc 


T{c} 





{Xa.Xsi, a.\ } {^{6,~a} . Xa){x{h,r^a) , Xa} {Xa,X^^„^a}} 

{Xb} {x^b^^a}>Xa,Xb} 



{Xb,X[^ty} {x[^b},Xb} 
{Xb,Xc} {xc,Xl^b}} 



{xb,X{c}} {x{c},Xc} 



Fig. 3. Left: cut tree based on the ASP-T proof in Figure 2. Right: resulting T-RES proof. 



to clause {xajX^b^^a}} in comp(n), and the partial assignment in the branch with 
entries To, and T{6, ^a} is contradictory to clause {.X{5,^a}j ^a} in comp(n). Thus 
we resolve on x^fj ^-^^} 0,'^'^ obtain the clause {xa}, which is contradictory to the 
single entry To in the prefix of the two leftmost branches in T' . Similarly, we can 
construct a resolution tree for clause {xa} corresponding to the right side ofT'. We 
finish the proof by resolving on Xa- The complete T-RES proof corresponding to the 
cut tree T' is shown in Figure 3 (right). 



Proof of Theorem 3.2 (ii) 

Let TT = (Ci, . . . , C„ =0) be a T-RES refutation of a set C of clauses. Recall that 
each derived clause Ci in tt is obtained by resolving on x from Cj = C U {x} and 
Ck = DU {x} for some j, k < i. 

An ASP-T proof T for nlp(C) is obtained from tt as follows. We start from C„, 
which is obtained from clauses Cj = {x} and Ck = {x} by resolving on a; € var(C), 
and apply in T the cut rule on ax corresponding to a;. Then we recursively continue 
the same way with Cj {Ck, respectively) in the generated branch with Fa^ (Ta^, 
respectively). Since tt is tree-like, each clause in the prefix (Ci, . . . , Cmax{j,fe}) of tt 
is either used in the derivation of Cj or Ck, but not in both. By construction when 
reaching Ci the branches of T correspond one-to-one to the paths in tt (seen as a 
tree) from C„ to the leaf clauses of tt. For a particular leaf clause C, we have for 
each literal I e C {I = x or I = x) contradicting entries for Ux in the corresponding 
branch of T, that is, Fax if I = x and Tax if I = x. Now we can directly deduce 
for each Fax the entry F{ax} and for each Tax the entry Fj^ax}- These entries 
together will allow us to directly deduce Fc (all the bodies of rules with atom c as 
the head are false). Since we have _L ^ ~c e nlp(C), we can deduce Tc, and the 
branch becomes contradictory. □ 



The following example illustrates the strategy used in the proof of Theorem 3.2 (ii). 
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Example 3.4 Recall the set C = {{x,y}, {x,y}, {x,y}, {x,y}} of clauses and the 
corresponding normal logic program nlp(C) presented in Example 2.2. The set C of 
clauses has a T-RES refutation n = {{x,y}, {x,y}, {x,y}, {x,y}, {y}, {y},9). Now 
we construct an ASP-T proof T for nlp(C) from tt as done in the proof of Theo- 
rem 3.2 (a). The resulting ASP-T proof T is presented in Figure 4- In the tableau 
we have omitted entries of the form T{1} and F{Z} for bodies consisting of a single 
default literal. The empty clause is obtained resolving on y from {y} and {y}, and 
thus we start with applying the cut rule on ay. The clause {y} is obtained resolving 
on X from {x,y} and {x,y\. We continue in the branch with Tay by applying the 
cut rule on Ox- Consider now the branch with Tay and Ta^ in the tableau. The 
branch corresponds to the clause {x,y} in C. Thus we arrive in a contradiction 
by deducing Fc4 from C4 <— ~aa; and C4 ^ '^o.y, and Tc4 from -L <— ~C4. Other 
branches become contradictory similarly. 



nlp(C) 

F_L 




Fc4 Fc2 Fc3 Fci 

TC4 TC2 TC3 Tci 

X XX X 



Fig. 4. An ASP-T proof for nlp(C) resulting from a T-RES proof 
TT = ({a;, y}, {x, y}, {x, y}, {x, y}, {y}, {y}, 0) for C in Example 3.4. 



4 Extended ASP Tableaux 

We will now introduce an extension rule^ to ASP-T, which results in Extended ASP 
Tableaux (E-ASP-T), an extended tableau proof system for ASP. The idea is that 
one can define names for conjunctions of default literals. 

Definition 4.1 Given a normal logic program IT and two literals hjh G dlit(n), 
the (elementary) extension rule in E-ASP-T adds the rule p <— li,l2 to H, where 
p atom(n) U 

It is essential that p is a new atom for preserving satisfiability. After an appHcation 
of the extension rule one considers the program E' = IIU {p ^ /i, Z2} instead of the 
original program 11. Notice that atom (11') = atom(n)U{p}. Thus when the extension 
rule is applied several times, the atoms introduced in previous applications of the 
rule can be used in defining further new atoms (see the forthcoming Example 4.2). 

^ Notice that the extension rule introduced here differs from the one proposed in (Hai et al. 2003) 
in the context of theorem proving. 
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When convenient, we will apply a generalization of the elementary extension rule. 
By allowing one to introduce multiple bodies for p, the general extension rule adds 
a set of rules 

[j{p ^ h,i, • • • , h,ki I P ^ atom(n) U {_L} and k^k e dlit(n) for all 1 < fc < fcj} 

i 

into n. Notice that equivalent constructs can be introduced with the elementary 
extension rule. For example, bodies with more than two literals can be decomposed 
with balanced parentheses using additional new atoms. 

Example 4.2 Consider a normal logic program H such that atom(n) = {a,b}. 
We apply the general extension rule and add a definition for the disjunction of 
atoms a and b, resulting in a program 11 U {c <— a. c <— 6}. An equivalent construct 
can be introduced by applying the elementary extension rule twice: add first a rule 
d ^ ~a, ^b, and then add a rule c ^ ^d, ^d. 

An E-ASP-T proof for (the unsatisfiability of) a program 11 is an ASP-T proof 
T for n U -E, where E is a. set of extending (program) rules generated with the 
extension rule in E-ASP-T. The length of an E-ASP-T proof is the length of T plus 
the number of program rules in E. 

A key point is that applications of the extension rule do not affect the existence 
of stable models. 

Theorem 4.3 Extended ASP Tableaux is a sound and complete proof system for 
normal logic programs. 

Proof 

Let T be an E-ASP-T proof for normal logic program 11 with the set E of extend- 
ing rules, that is, an ASP-T proof for HUE. Since ASP-T is sound and complete, 
there is a complete non-contradictory branch in T if and only if 11 U is satisfi- 
able. The set atom(n) is a splitting set for ILU E, since head(r) ^ atonn(n) U {±} 
for every extending rule r G E. Furthermore, bottom(n U i5, atom(n)) =11 and 
top(n U E, atom(n)) = E. By Theorem 2.1, H U is satisfiable if and only if there 
is a solution to 11 U S with respect to atom (11), that is, there is a stable model 
M C atom(n) for 11 and a stable model N for e\/a\{E, M). Since the rules in E are 
generated using the extension rule (recall also _L ^ head(i?)), there is a unique stable 
model for eval(i5, M) for each M C atom(n). Thus there is a solution to IlUi? with 
respect to atom(n) if and only if 11 is satisfiable, and moreover, IIUS is satisfiable 
if and only if 11 is satisfiable, and E-ASP-T is sound and complete. □ 

4-1 The Extension Rule and Well- Founded Deduction 

An interesting question regarding the possible gains of applying the extension rule 
in E-ASP-T with the ASP tableau rules is whether the additional extension rule 
allows one to simulate well-founded deduction (rules (ht),(14),(it), and {\\)) with 
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the other deduction rules ((b)-(g),(h§),(i§))^. We now show that this is not the case; 
the extension rule does not allow us to simulate reasoning related to unfounded 
sets and loops. This is implied by Theorem 4.4, which states that, by removing 
rules (ht),(h|),(it), and (ij) from E-ASP-T, the resulting tableau method becomes 
incomplete for NLPs. 

Theorem 4.4 Using only tableau rules (a)-(g), (h%) and (i^), and the extension 
rule does not result in a complete proof system for normal logic programs. 

Proof 

Consider the NLP 11 = {_L <— ~«. a ^ b. b ^ a}. Although 11 is unsatisfiable, 
in the proof system having only the tableau rules (a)-(g),(li§), and (i§), we can 
construct a complete and non- contradictory tableau with a single branch 

T = (n U {F±}, F{~a} (e), Ta (c), T{6} (i§), T6 (g), T{a} (i§)) 

for n. 

Consider an arbitrary set E of extending rules generated using the extension rule 
in E-ASP-T. Recall that head{E) n (atom(n) U {_L}) = 0. We can form a complete 
non-contradictory tableau T' for 11 U i? as follows. 

First, define Hq = atom(n) U {±} and 

Hi = {h€ head{E) \ [J (body(r)+ U body(r)-) c[jHj}. 

rerule(/t) j<i 

Thus the sets Hi are used to define a level numbering for the atoms defined in the 
extension E. Furthermore, we define 

Ei = {rGUuE\ head(r) e [j Hj} 

for all i > 0. Notice that Eq = 11, and 11 U £" = Ui>o-^i- '^^^ show using 
induction that for each z > 0, the only branch B m T can be extended into a 
complete non-contradictory branch for Ei using tableau rules (b)-(g), (h§), and (i§). 

The base case [i = 0) holds by definition. Assume that the claim holds for i — 1, 
that is, B can be extended into a complete non-contradictory branch B' for 
Consider now arbitrary r e i^j. By definition body(r)+Ubody(r)~ C atom(i?i_i) for 
each r € Ei. Since B' is complete, it contains entries for each a G atom(ii^i_i), and 
we can deduce an entry for body(r) using ASP tableau rule (b) or (f) (depending 
on the entries in B'). If the entry T(body(r)) has been deduced, we can deduce Th 
for h = head(r) using (d). Otherwise, we have deduced the entries F(body(r')) for 
every r' e Ei such that h = head(r'), and we can deduce Fh using (h§). Thus 
we have deduced entries for all a € atom(iJj) U body(£'i) and the branch is non- 
contradictory. Furthermore it is easy to check that the branch is closed under the 
tableau rules (b)-(g),(h§), and (i§). 

* Notice that the proof system consisting of tableau rules (a)-(g),(h§), and (i§) amounts to com- 
puting supported models (Gebser and Schaub 2006b). 
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Thus we obtain a complete and non-contradictory tableau for 11 U -E. Since we 
cannot generate a contradictory tableau for 11 with tableau rules (a)-(g),(h§), and 
(i§), we cannot generate one for ILUE either. This is in contradiction with the fact 
that n is unsatisfiable. □ 

5 Proof Complexity 

In this section we study proof complexity theoretic issues related to E-ASP-T from 
several viewpoints: we will 

• consider the relationship between E-ASP-T and the Extended Resolution proof 
system (Tseitin 1969), 

• give an explicit separation of E-ASP-T from ASP-T, and 

• relate the extension rule to the effect of program simplification on proof 
lengths in ASP-T. 

5.1 Relationship with Extended Resolution 

The system E-ASP-T is motivated by Extended Resolution (E-RES), a proof system 
originally introduced in (Tseitin 1969). The system E-RES consists of the resolution 
rule and an extension rule that allows one to expand a set of clauses by iteratively 
introducing equivalences of the form x = li A I2, where a; is a new variable, and h 
and I2 are literals in the current set of clauses. In other words, given a set C of 
clauses, one application of the extension rule adds the clauses {x,liJ2}, {x^h}, 
and {00,12} to C. The system E-RES is known to be more powerful than RES; in 
fact, E-RES is polynomially equivalent to, for example, extended Frege systems, and 
no superpolynomial proof complexity lower bounds are known for E-RES. We will 
now relate E-ASP-T with E-RES, and show that they are polynomially equivalent 
under the translations comp and nip. 

Theorem 5.1 E-RES and E-ASP-T are polynomially equivalent proof systems in 

the sense that 

considering tight normal logic programs, E-RES under the translation comp polyno- 
mially simulates E-ASP-T, and 

considering sets of clauses, E-ASP-T under the translation nip polynomially simu- 
lates E-RES. 

Proof 

(i): Let T be an E-ASP-T proof for a tight NLP U, that is, T is an ASP-T proof 
for UL) E, where E is the set of extending rules generated in the proof. We use 
the shorthand Xi for the variable corresponding to default literal I in comp(nu£'), 
that IS, Xl — Xa {xi = Xa, respectively) ii I = a {I = ~o, respectively) for a G 
atom(n U E). By Theorem 3.2 there is a polynomial RES proof for comp(n U E). 
Now consider comp(n). We apply the extension rule in E-RES in the same order 
in which the extension rule in E-ASP-T is applied when generating the set E of 
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extending rules. In other words, we apply the extension rule in E-RES as follows for 
each rule r = h ^ li,l2 in E. If body(r) = {^1,^2} G body(n), then there are the 
clauses a;^;^ j^} = ^ in comp(n). If this is the case, we generate the clauses 
Xh = X{ii,i2} with the extension rule in E-RES. Otherwise, that is, if body(r) does 
not have a corresponding propositional variable in comp(n), we generate the clauses 
Xh = 2;{ii,;2} ^-nd x^i^^i^y = xi^ A xi^. Denote the resulting set of extending clauses 
by E'. Now we notice that comp(n) U E' = comp(n U E), and therefore the RES 
proof for comp(n U E) is an E-RES proof for comp(n) in which the extension rule 
in E-RES is applied to generate the clauses in E' . 

(ii): Let tt = (Ci, . . . , C„ = 0) be an E-RES proof for a set C of clauses. Let E be 
the set of clauses in tt generated with the extension rule. We introduce shorthands 
for atoms corresponding to literals, that is, a; = (a; = ~aa;) if I = x {I = x) for 
X e var(CUi^). Now, an E-ASP-T proof for nlp(C) is generated as follows. First, we 
add the following rules to nlp(C) with the extension rule in E-ASP-T: 

dx 0'lii0'l2 for 6ach extension x = l\ h I2; (10) 
c <— a; for each literal I G C for a clause C € tt such that C ^ C; and (11) 
Pi <— ci and Pi <— Ci,Pi-i for each C, G tt and 2 <i <n. (12) 

Then, from i = 1 to n — 1 apply the cut rule on Pi in the branch with Tpj for 
all j < i. We now show that for each i the branch with Fpi and Tpj for all j < i 
becomes contradictory without further application of the cut rule. First, deduce Fcj 
from Fpi using the rule (12) for i. One of the following holds for Cj € tt: either 
(a) Ci e C, (b) Ci is a derived clause, or (c) Ci G E. 

(a) If Cj G C we can deduce Tcj from _L <— ~Ci G nlp(C), and the branch becomes 
contradictory. 

(b) If d is a derived clause, that is, d is obtained from Cj and Ck for j, k < i resolving 
on x, then d = {Ck^Cj) \ {x, x}. For all the literals I £ Ci we deduce fa; from the 
rules (11) in the extension. From Tpj and Tpk we deduce Tcj and Tck using the 
rule (12) in the extension for j and k, respectively. Furthermore because we have 
entries fa; for each I in {Ck U Cj) \ {x, x}, we deduce Ta^; and Fax and the branch 
becomes contradictory. Recall that there is a rule c ^ ai for each clause C € n and 
literal I G C either in nlp(C) or in the extension (rules in (11)). 

(c) If d G E, then d is of the form {xjijh}, {x,h}, or {Xjh} for x = h A h- 
For instance, if Ci = {x, then from Ci ^ ^ax and Ci <— ai^ we deduce Ta^; 
and fajj. The branch becomes contradictory as T{a;i,a(2} and tai^ are deduced 
from a rule (10) in the extension. The branch becomes contradictory similarly, if d 
is of the form {x, h, I2} or {x, 12}- 

Finally, consider the branch with Tpi for alH = 1 . . . n — 1. The empty clause C„ in tt 
is obtained by resolving Cj — {x} and Ck = {x} in tt for some j, k < n. Thus we can 
deduce Tcj and Tck from rules (12) for j and k, respectively, and furthermore, Ta^ 
and Fux from Cj ^ ax and Ck <— ~ax, resulting in a contradiction in the branch. 
The obtained contradictory ASP tableau is of Hnear length with respect to tt. □ 
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5.2 Pigeonhole Principle Separates Extended ASP Tableaux from ASP 

Tableaux 

To exemplify the strength of E-ASP-T, we now consider a family of normal logic 
programs {!!„} which separates E-ASP-T from ASP-T, that is, we give an explicit 
polynomial-length proof for n„ for which ASP-T has exponential- length minimal 
proofs with respect to n. We will consider this family also in the experiments re- 
ported in this article. 

The program family {PHP""*"^} in question is the following typical encoding of 
the pigeonhole principle as a normal logic program: 

PHP;:+i = {±^~pi,i,...,~pi,„ I l<i<n + l} U (13) 
{±^Pi,k,Pj,k \ l <i < j <n + l, l<k<n} U (14) 
{Pi,j ^ ~Pi,j- Pij ^ ^Pi,j I 1 < i < n + 1, 1 < J < n}. (15) 

In the program above, pij has the interpretation that pigeon i sits in hole j. The 
rules in (13) require that each pigeon must sit in some hole, and the rules in (14) 
require that no two pigeons can sit in the same hole. The rules in (15) enforce that 
for each pigeon and each hole, the pigeon either sits in the hole or does not sit in 
the hole. Each PHP^"'"^ is unsatisfiable since there is no bijective mapping from an 
(n + l)-element set to an n-element set. 

Theorem 5.2 The complexity o/{PHP^~'~^} with respect to n is 

(i) polynomial in E-ASP-T, and 
(a) exponential in ASP-T. 



Proof 

(i): In (Cook 1976) an extending set of clauses is added to a clausal encoding Cphp of 
the pigeonhole principle^ so that RES has polynomial-length proofs for the resulting 
set of clauses. By Theorem 5.1 (ii) there is a polynomial-length E-ASP-T proof for 

nlp(CpHp) = {Pi,j ^ ^p'ij. p'ij < Pi,j |l<i<n-|-l, l<i<n}U 

{_L <- ~Cj I 1 < i < n -M} U 

{_L ^ r^djk \ l<i<j<n + l,l<k<n}L\ 
{ci Pij |l<j<n, l<i<n-|-l}U 

{cijk ^ '^Pi,k- Cijk •*— ~Pj,fc I 1 < i < i < n -I- 1, 1 < fc < n}. 

For simplicity, we keep the names of the atoms Pij unchanged in the translation. 
In more detail, let tt = (Ci, C2, . . . , = 0) be the polynomial-length E-RES 

® The particular encoding, for which there are no polynomial-length RES proofs (Haken 1985), is 

CpHP = Ul<i<n+l{{Vj = l Pi,]}} U Ul<i<i<„+l,l<fe<„{{^P»,fe V ^Pj,k}}- 
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proof^ for the clausal representation CpHP- Let 

EXT' = {ei^. ^ e[j ^ e\+\ e'+l- | 1 < i < Z and 1 < j < / - 1} 

for 1 <l <n, where each e"^""^ is pi^j. The extension EXT' corresponds the set of 
extending clauses in (Cook 1976) similarly to the set of rules (10) in part (ii) of the 
proof of Theorem 5.1. Furthermore, E(7r) consists of the sets of rules (11) and (12) 
defined in the proof of Theorem 5.1 (ii). By applying the strategy from the proof 
of Theorem 5.1 (ii), we obtain a polynomial-length ASP-T proof for 

nlp(CpHp)U U EXT'uE(7r). 

l</<n 

Now, we use the same strategy to construct a polynomial ASP-T proof for the 
program 

EPHP;^+^ = U IJ EXT' U E'(7r), 

l<l<n 

where E'(7r) consists of rules c <^ ai for each literal / e C for each clause C G tt (that 
is, rules as in (11) but without the restriction C ^ Cphp) together with the rules 
in (12). The only difference comes in step (a) in the proof of Theorem 5.1 (ii), that 
is, when we have deduced Fc corresponding to C G Cphp- Since we do not have the 
rule ± <— ~c in EPHPJJ^^, we cannot deduce Tc to obtain a contradiction. Instead, 
we can deduce a contradiction without using the ASP-T cut rule through a program 
rule in PHP""'"^ that corresponds to the clause C. For instance, if C = {-'Pi,k, ~'Pj,k}, 
we have the rules c ^ ~Pi,fc and c ^ '^Pj.fc in E'(7r) and the rule _L ^ Pi,k,Pj,k in 
PBP^+^. From Fc, we deduce Tpi^k and Tpj^k- From F_L and _L <— Pi^k^Pj,k, we de- 
duce 'F{pi^k-,Pj,k}, and furthermore, from Tpi^k and 'F{pi^k,Pj,k}, we deduce Fpj^k- 
This results in a polynomial-length E-ASP-T proof for PHP"^ . 

(ii): Assume now that there is a polynomial ASP-T proof for PHPJ^'*'^. By The- 
orem 3.2, there is a polynomial T-RES proof for com p(PHP;j+^). Notice that the 
completion comp(PHP"'''^) consists of the clausal encoding Cphp of the pigeonhole 
principle and additional clauses (tautologies) for rules of the form pij ^ ^Pij^ 
p'j^ j -f— ^Pij. It is easy to see that these additional tautologies do not affect the 
length of the minimal T-RES proofs for comp(PHP;j+i). Thus there is a polynomial- 
length T-RES proof for the clausal pigeonhole encoding. However, this contradicts 
the fact that the complexity of the clausal pigeonhole principle is exponential with 
respect to n for (Tree-like) Resolution (Haken 1985). □ 

can also easily obtain a non-tight program family to witness the separation 
demonstrated in Theorem 5.2. Consider the family 

{PHP;:+i U {pi^j ^ pij I 1 < » < n + 1, 1 < j < n}}, 



The polynomial-length E-RES proof for Cphp is not described in detail in (Cook 1976). Details 
on the structure of the RES proof can be found in (Jarvisalo and Junttila 2008). The intuitive 
idea is that the extension allows for reducing PHP5J+^ to PHP"_j^ with a polynomial number 
of resolution steps. 
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which is non-tight with the additional self- loops {pij <— Pij}, but preserves (un) satis- 
fiability of PHP""*"^ for all n. Since the self-loops do not contribute to the proofs 
for PHP"^^, ASP-T still has exponential-length minimal proofs for these programs, 
while the polynomial-length E- ASP-T proof presented in the proof of Theorem 5.2 
is still valid. 

The generality of the arguments used in the proof of Theorem 5.2 is not limited to 
the specific family PHP""*"^ of NLPs. For understanding the general idea behind the 
explicit construction of EPHP^^^, it is informative to notice the following. Instead 
of considering PHPJ^^^, one can apply the argument in the proof Theorem 5.2 
using any tight NLP 11 which represents a set of clauses C for which (i) there is no 
polynomial-length RES proof, but for which (ii) there is a polynomial-length E-RES 
proof . By property (ii) we know from Theorem 5.1 (ii) that there is a polynomial- 
length E-ASP-T proof for H. 

5.3 Program Simplification and Complexity 

We win now give an interesting corofiary of Theorem 5.2, addressing the effect of 
program simplification on the length of proofs in ASP-T. 

Tightly related to the development of efficient solver implementations for ASP 
programs arising from practical applications is the development of techniques for 
simplifying programs. Practically relevant programs are often generated automat- 
ically, and in the process a large number of redundant constraints is produced. 
Therefore eflflcient program simplification through local transformation rules is im- 
portant. While various satisfiability-preserving local transformation rules for sim- 
plifying logic programs have been introduced (see (Eiter et al. 2004) for example), 
the effect of applying such transformations on the lengths of proofs has not received 
attention. 

Taking a first step into this direction, we now show that even simple transfor- 
mation rules may have a drastic negative effect on proof complexity. Consider the 
local transformation rule 

red(n) = n\{r en I head(r) ^ |J (B+ U S") and head(r) 7^ _L}. 

Bebody(n) 

A polynomial-time simplification algorithm red*(n) is obtained by closing pro- 
gram n under red. Notice that we have red*(EPHP;j+^) = PHP;j+^ Thus, by 
Theorem 5.2, red* transforms a program family having polynomial complexity in 
ASP Tableaux into one with exponential complexity with respect to n. 

The rules removed by red* are redundant with respect to satisfiability of the 
program in the sense that red* preserves visible equivalence (Janhunen 2006). The 
visible equivalence relation takes the interfaces of programs into account: atom(n) 
is partitioned into v(n) and h(n) determining the visible and the hidden atoms in II, 
respectively. Programs Hi and 112 are visibly equivalent, denoted by Hi =v 112, if 
and only if v(ni) = v(n2) and there is a bijective correspondence between the stable 
models of Hi and 112 mapping each a G v(ni) onto itself. Now if one defines v(n) = 
atom(red*(n)) = v(red*(n)), that is, assuming that the atoms removed by red* are 
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hidden in IT, one can see that red* (IT) =v H. Hence, even though there is a bijective 
correspondence between the stable models of EPHP"+^ and red*(EPHP;j+^) = 
PHP""*"^, red* causes a superpolynomial blow-up in the length of proofs in ASP-T 
and the related solvers, if applied before actually proving EPHP^"*"^. 

6 Experiments 

We experimentally evaluate how well current stato-of-tho-art ASP solvers can make 
use of the additional structure introduced to programs using the extension rule. For 
the experiments, we ran the solvers ^ smodels (Simons et al. 2002) (version 2.33, a 
widoly used lookahoad solvor), clasp (Gcbscr ct al. 2007) (vorsion 1.1.0, with many 
techniques — including conflict learning — adopted from DPLL-based SAT solvers), 
and cmodels (Giunchiglia et al. 2006) (version 3.77, a SAT-based ASP solver running 
the conflict-learning SAT solver zChaff (Moskewicz et al. 2001) version 2007.3.12 as 
the back-end). The experiments were run on standard PCs with 2-GHz AMD 3200+ 
processors under Linux. Running times were measured using /usr/bin/time. 

First, we investigate whether ASP solvers are able to beneflt from the extension 
in EPHP""*"^. Wc compare the number of decisions and running times of each of 
the solvers on PHP;^+\ CPHP;j+i = PE.Pl+^ U Ui<,<„EXT', and EPHP;j+^ By 
Theorem 5.2 the solvers should in theory be able to exhibit polynomially scaling 
numbers of decisions for EPHP^"'"^. In fact with conflict-learning this might also 
be possible for CPHP"^^ due to the tight correspondence with conflict-learning 
SAT solvers and RES (Beame et al. 2004). The results for n = 10 ... 12 are shown 
in Table 1. Whfle the number of decisions for the conflict-learning solvers clasp 

^ We note that the detailed results reported here differ somewhat from those reported in the 
conference version of this work (Jarvisalo and Oikarinen 2007). This is due to the fact that, for 
the current article, we used more recent versions of the solvers. 



Table 1. Results on PHP;j+\ CPHP;j+\ and EPHP;^+i with timeout (-) of 2 

hours. 









Time (s) 






Decisions 




Solver 


n 


PHp;;+i 


CPHP^+i 


EPHP;;+^ 


PHP;;+i 


CPHP;;+^ 


EPHP;^+i 


smodels 


10 


34.02 


119.69 


8.65 


164382 


144416 





smodels 


11 


486.44 


1833.48 


21.70 


1899598 


1584488 





smodels 


12 






49.28 









clasp 


10 


6.81 


7.29 


10.05 


337818 


216894 


38863 


clasp 


11 


58.48 


45.00 


82.07 


1840605 


882393 


203466 


clasp 


12 


579.28 


509.43 


941.23 


12338982 


6434939 


1467623 


cmodels 


10 


1.60 


1.69 


7.87 


8755 


8579 


12706 


cmodels 


11 


8.20 


8.51 


43.96 


24318 


23758 


42782 


cmodels 


12 


46.33 


54.26 


122.72 


88419 


94917 


88499 
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and cmodels is somewhat reduced by the extensions, the solvers do not seem to be 
able to reproduce the polynomial- length proofs, and we do not observe a dramatic 
change in the running times. With a timeout of 2 hours, smodels gives no answer for 
n = 12 on PHP;j+i or CPHP;;;+\ However, for EPHP^J+i smodels returns without 
any branching, which is due to the fact that smodels' complete lookahead notices 
that by branching on the critical extension atoms (as in part (ii) of the proof of 
Theorem 5.2) the false branch becomes contradictory immediately. With this in 
mind, an interesting further study out of the scope of this work would be the 
possibilities of integrating conflict learning techniques with (partial) lookahead. 

In the second experiment, we study the effect of having a modest number of 
redundant rules on the behavior of ASP solvers. For this we apply the procedure 
ADDRANDOMREDUNDANCY(n, n,p) shown in Algorithm 1. Given a program IT, 
the procedure iteratively adds rules of the form h,l2 to 11, where li,l2 are 

random default literals currently in the program and rj is a new atom. The number 
of introduced rules is p% of the integer n. 



Algorithm 1 ADDRANDOMREDUNDANCY(n, n,p) 

1. For i = 1 to Lioo^-J- 

la. Randomly select li,l2 S dlit(n) such that h ^ 12- 
lb. n := n U {rj <- h,l2}, where r, ^ atom(n) U {_L}. 

2. Return 11 



In Figure 5, the median, minimum, and maximum number of decisions and run- 
ning times for the solvers on AddRandomRedundancy(PHP""'"^, n,p) are shown 
for p = 50, 100, . . . ,450 over 15 trials for each value of p. The mean number of 
decisions (left) and running times (right) on the original PHP^^^ are presented 
by the horizontal lines. Notice that the number of added atoms and rules is hnear 
to n, which is negligible to the number of atoms (in the order of n^) and rules (n^) 
m PHP^+\ For similar running times, the number of holes n is 10 for clasp and 
smodels and 11 for cmodels. The results are very interesting: each of the solvers 
seems to react individually to the added redundancy. For cmodels (b), only a few 
added redundant rules are enougli to worsen its behavior. For smodels (c), the num- 
ber of decisions decreases linearly with the number of added rules. However, the 
running times grow fast at the same time, most likely due to smodels' lookahead. 
We also ran the experiment for smodels without using lookahead (d). This had a 
visible effect on the number of decisions compared to smodels on PHP"'*'"'^. 

The most interesting effect is seen for clasp (a); clasp benefits from the added 
rules with respect to the number of decisions, while the running times stay similar 
on the average, contrarily to the other solvers. In addition to this robustness against 
redundancy, we believe that this shows promise for further exploiting redundancy 
added in a controhed way during search; the added rules give new possibilities to 
branch on definitions which were not available in the original program. However, 
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Fig. 5. Effects of adding randomly generated redundant rules to PHPJ^"*"^. 
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for benefiting from redundancy with running times in mind, optimized lightweight 
propagation mechanisms are essential. 

As a final remark, an interesting observation is that the effect of the transfor- 
mation presented in (Anger et al. 2006), which enables smodels to branch on the 
bodies of rules, having an exponential effect on the proof complexity of a particular 
program family, can be equivalently obtained by applying the ASP extension rule. 
This may in part explain the effect of adding redundancy on the number of decision 
made by smodels. 

7 Conclusions 

We introduce Extended ASP Tableaux, an extended tableau calculus for normal 
logic programs under the stable model semantics. We study the strength of the 
calculus, showing a tight correspondence with Extended Resolution, which is among 
the most powerful known prepositional proof systems. This sheds further light on 
the relation of ASP and prepositional satisfiability solving and their underlying 
proof systems, which we believe to be for the benefit of both of the communities. 

Our experiments show the intricate nature of the interplay between redundant 
problem structure and the hardness of solving ASP instances. We conjecture that 
more systematic use of the extension rule is possible and may even yield perfor- 
mance gains by considering in more detail the structural properties of programs in 
particular problem domains. One could also consider implementing branching on 
any possible formula inside a solver. However, this would require novel heuristics, 
since choosing the formula to branch on from the exponentially many alternatives 
is nontrivial and is not applied in current solvers. We find this an interesting future 
direction of research. Another important research direction set forth by this study 
is a more in-depth investigation into the effect of program simplification on the 
hardness of solving ASP instances. 
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